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The  PDP-11/20  is  both  host  and  target  coaputer  for  the  L011  language  and 
coapiler.  It  is  operated  in  a single-user,  hands-on  environment.  Loll 
is  the  systea  iaplementation  language  anong  a planned  faaily  of 
languages  (HID11,  Hill)  and  a single-user  operating  systea,  collectively 
called  the  Evelyn  systea.  The  L011  compiler  and  the  operating  systea 
have  been  iapleaented,  but  the  other  coapilers  have  not. 

Currently,  L011  is  iapleaented  as  a single-pass,  recursive  descent 
coapiler.  However,  the  design  of  the  language  and  the  design  of  soae 
aspects  of  the  operating  system  reflect  our  intent  to  iaplenent  the 
Evelyn  coapilers  using  nultiple-pass  techniques  based  on  an  LR(1) 
p arser. 


Laflquaqg_facilities 

L011  is  a member  of  the  "PL"  class  of  languages,  based  on  the  ideas 
introduced  in  Mirth's  PL360  [6],  and  including  BLISS  [7],  C [3],  PL-11 
[5],  and  LIL  [2].  The  language  facilities  for  modular  construction  of 
programs,  and  for  control  of  the  sequence  of  execution,  are  high-level 
--  comparable  to  those  of  algebraic  languages.  The  language  facilities 
for  data  access  are  low-level  --  approximately  1:1  with  the  hardware 
instructions  of  the  target  computer,  comparable  to  an  assembler. 

Included  in  the  modular  facilities  are  the  declarations  of  nodules, 
procedures,  and  data,  and  access  restriction  statements  and  compile-tine 
definitions.  The  nodular  facilities  only  indicate  the  existence  of  data 
declarations;  the  details  of  data  declarations  are  included  in  the  data 
f acilities. 

Included  in  the  sequencing  facilities  are  conditional  statements  (IF, 
THEN,  ELSE,  CASE),  procedure  calls  and  loop  control  (CALL,  REPEAT, 
RETURN) , and  assignments. 

Included  in  the  data  facilities  are  the  details  of  data  declarations, 
the  PDP-11  instruction  set,  PDP-11  addressing  nodes  and  register  usage, 
etc. 
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The  nodular,  sequencing,  and  data  facilities  of  L011  ace  described  by  3 
separate  subgraaaars,  whose  union  forns  the  coaplete  grasaar  of  L011. 

The  subgraaaars  are  nearly  disjoint;  each  pair  of  subgraaaars  shares 
only  a few  non-terminal  syabols. 

The  modular  and  sequencing  subgraaaars  may  be  included  in  the  complete 
grammars  of  other  Evelyn  languages,  but  the  data  subgrammar  is  unique  to 
the  low-level  data  access  of  L011. 


Independence  from  syntactic  sugap 

The  graaaar  defines  a mapping  from  L011  source  statements  to  parse 
trees.  Parse  trees  are  napped  onto  abstract  syntax  trees,  which  carry 
structural  information  to  the  code  generators.  Any  change  in  the 
gramnar  would  result  in  a change  in  the  parse  trees.  Me  define 
syntactic  sugar  changes  to  be  those  changes  in  the  source  language  which 
can  be  made  not  to  cause  changes  in  the  abstract  syntax  trees. 

Me  consider  the  L011  language  to  be  describable  by  any  graaaar  which 
differs  at  nost  in  its  syntactic  sugar  from  the  qramaar  presented 
below.  In  practical  terns,  we  have  freed  ourselves  from  concern  over 
ainor  stylistic  issues,  without  significantly  increasing  the  burden  on 
the  compiler  iapleaentor. 
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$ Subgrammar  of  the  nodular  facility 


Program  = Module  ♦ ; 

Module  = 

Nodule_naae  ' : ' 

'MODULE*  »;* 

Module  body_elenent  ♦ 

•END*  ; 

Module_name  = ’Identifier'  ; 

Module_body_e lement 
= ACCESS_statement 
| CONSTA NT_st ate men t 
| S P EC IF Y_ statement 
| DECLARE_statement 
| Procedure 

t 

ACCESS_statement  = 

'ACCESS'  Accessed_names  'ONLY'  'FROM'  Scopes  ; 

A ccessed_na*es  = Accessed_ na me  { ' , ' Accessed_nane  ) * ; 

Accessed_name  = 'Identifier'  ; 

Scopes  = Scope  ( Scope  ) * ; 

Scope  = Entry_name  ( ! ' ('  Path  'THROUGH'  Path  ')  ' ) ; 

Entry_name  = 'Identifier'  ; 

Path  = Direction  Distance  'DISTANT'  ; 

Direction  = 'CALLEES'  | 'CALLERS'  ; 

Distance  = Co nstant_expression  | 'ARBITRARILY'  ; 

CON ST ANT_ statement  = 

'CONSTANT'  Name_of_consta nt  •=•  Consta nt_e xpression  *;'  ; 
Name_of_constant  = 'Identifier'  ; 

Const ant_expression  = Const_ numeric_e xpression 

{ Const_binary_loqic_op  Const_numeric_expression  ) * ; 

Const_numer ic_ex press  ion  = 

Const_terra  ( Const_binary_add_op  Const_term  ) * ; 

Const_term  = 

Const_factor  ( Con st_bina ry_mult_op  Const_factor  ) * ; 

Const_factor  = Const_pr imary  | Const_unar y_op  Const_factor  ; 
Const_prinary  = Value  | •('  Constant_expression  •) ' ; 
Const_binary_logic_op  = '&'  | ' | ' ; 

Const_binar y_add_op  = | '-'  ; 

Const_binary_ mult_op  = ' * ' | '/'  | '//'  ; 

C onst_unary_o p = ' - ' | '-'  ; 

S PECI FY_statement  = 'SPECIFY'  Specification  ';'  ; 

DEC LA  RE~statemen  t = 'DECLARE'  Declaration  ; 

Procedure  = 

Entry_name  ' : ' 

•PROCEDURE'  ( | Formal_inputs  ) ( | 'RETURNS'  Forma l_outputs  ) ' 

( CON ST AN T_ statement  | SPECIFY_statement  | DECLARE~statenent  ) 
Local_pr ocedu re  * 

Statement  ♦ 

• END'  ' ; ' ; 

Formal_inputs  = For mal_parameters  ; 

Formai_outputs  = Forraal_para meters  ; 

F ormal_pa rame ters  = '(•  Formal_name  ( ','  Pormal_name  )*')'; 
Formal_name  = 'Identifier'  ; 

Local_procedure  = 

Entry_name  • : ' 

'PROCEDURE' 

Statement  ♦ 

'BMP*  »;»  ; 


$ Subgraanar  of  the  sequencing  facility 

Statement  = IF_statement  | Sinple_stateaent  ; 

I F_sta  temen  t = 

'IF'  Logical_expressior 
'THEN'  Simple  statement 
( | ’ELSE ' Statement  ) ; 

Logical_expression  = Loqical_ter*  ( 'OR'  Logical_tera  ) * ; 

Logical_term  = Logical^f actor  ( 'AND'  Logical_factor  ) * ; 

Logical_f actor  = ’HOT*  * ( Test  | • ('  Loq ical_expression  ') ' ) ; 

Simpl e_statement 

= Assignment_statement 
| Group 

| CALL_sta tement 
| REPEAT_statement 
| RETURN_statement. 

| Eapty_stateaent 

f 

Assignaent_st  at.ement  = Assignment  ; 

G roup 

= ( I Label  • : • ) 

'DO'  ' ; • 

Statement  ♦ 

•END'  ' ;• 

| ( | Label  • : • ) 

•DO'  'CASE'  Datum  •;• 

( 'CASE'  Constants  Statement  ) ♦ 

•END*  •;• 

t 

Label  = 'Identifier'  ; 

Const  ants 

= Constant_oxpi:ession  ( ','  Constant_expression  ) * 

| 'DEFAULT7 
I 

C ALL_stateaent  = 

•CALL* 

Ent.ry_name  ( 1 Actua l_inputs  ) 

( | 'RETURNS'  Actual_outputs  ) ';'  ; 

Actual_inputs  = Actual_parameters  ; 

Actual_outputs  = Actual_parameters  ; 

Actua l_parame ters  = '('  Actual_parameter  ( Actual_parameter  ) * ')'; 

Actua l_pacameter  = | Passed_parameter  ; 

REPEAT  statement  = 'REPEAT'  ( | 'Identifier'  ) ; 

R ETURN~statem  en t = 'RETURN'  ( | ' FROM*  'Identifier*  ) ';'  ; 

Empty  statement  = : 


$ Subgrammar  of  the  data  facility 


V 


S pecification 

= ( | Data_r.ame  ) •[  • Size  • 

I Data_name  •[  • Specification  ( ','  Specification  ) * 

0 

Declaration 

= ( | Data_name  ) '[  • Size  • ( | Initialization  ) 

| Data_name  '[  • Declaration  ( ' Declaration  ) * •]' 

0 

Dat.a_name  = ‘Identifier’  ; 

Size  = Constant_expression  ; 

Initialization  = '<-•  Constant_expression  ; 

Passed_parameter  = Datum  ; 

Assignment  = Expression  ; 

E xpression 

= Prefix  Precision  Datum 
| Term 

I Expression  Infix  Precision  Term 
Prefix  = '- 1 | • -»*  ; 

Infix  = •<-•  | '♦'  | | *|‘  | | ’*•  | •/•  | •//'  I ‘\‘  | '\\' 

| •«•  | •«<’  | »>>•  | •»>»  ; 

Precision  = | 'W'  | 'B‘  ; 

Terra  = Datum  | •('  Expression  ') ' ; 

Datum 

= Register 
| »«#♦*  Register 
| '-a*  Register 

| 'a*  ' ('  Register  Value  ’)  ' 

J 'a*  Register 
| ‘aa*‘  Register 
| ‘-aa‘  Register 
I »aa'  ' ('  Register  Value  ')' 

| Value 
| 'a'  Value 
| 'aa‘  Value 
| ’NEGATIVE' 

| 'ZERO’ 

| 'OVERFLOW' 

J 'CARRY' 

0 

Register  = 'R'  '.'  Value  ; 

Value  = Nutner ic_constant  | Qualif ied_name  ; 

N umeric_constant 
= 'Constant' 

j Comp il e_time_f unct ion_name 

' ('  Con stan t_expr ession  ( ','  Constant_expr ession  ) ♦ 

0 

C ompi  le_time_f  unction_r.ame  = 'Identifier'  ; 

Quali f ied_narae  = 'Identifier'  ( '.'  'Identifier'  ) * ; 

Test 

= Comparand  Relation  Signed  Precision  Comparand 
| Datura  'S'  Precision  Datum  Relation  Datum 

0 

Comparand  = Datum  | 'LAST  RESULT'  ; 

Relation  * •=•  | | •<•  I •<=  • I •>*•  I •>•  ; 

Signed  = | 'U'  ; 
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There  is  a simple  compiler  control  language  by  which  a user  can  name  the 
files  of  LO 1 1 source  code  which  must  be  compiled  (in  the  order  they  were 
named)  to  create  any  particular  program.  The  compiler  control  language 
is  not  described  in  this  paper. 

Translation  of  a L011  program,  from  source  code  to  start  of  execution, 
is  accomplished  by  a compilation  step  and  a loading  step.  There  is  no 
linking  step  for  separately  compiled  nodules.  Conceptually,  each 
recompilation  of  a L011  program  is  a compilation  of  the  entire  program. 
External  to  the  L01 1 language,  there  is  an  optimization  mechanism 
embedded  in  the  L01 1 compiler,  the  Evelyn  text  editor,  and  the  Evelyn 
operating  system  which  permits  the  compiler  to  perform  a partial 
recompilation  of  only  those  source  files  which  have  been  changed  since 
the  last  preceding  partial  recompilation.  Our  existing  single-pass 
compiler  cannot  take  advantage  of  the  optimization  mechanism,  but  the 
m ulti pie- pass  compiler  will  be  able  to. 


S emantics.  modular  facility 

Each  module  (a  linguistic  entity)  is  stored  in  a separate  file  (an 
operating  system  entity). 

Access  restriction  statements  are  described  in  a separate  section  of 
this  report. 

Constant  expressions  are  evaluated  at  compile  time.  Subsequently  during 
compilation,  the  name  of  a constant  may  be  used  in  any  context  in  which 
the  constant  itself  could  be  used. 

Data  must  be  specified  or  declared  preceding  its  first  use.  Procedures 
should  be  declared  preceding  their  first  use,  although  the  compiler  does 
permit  a few  exceptions.  The  exceptions  should  be  restricted  to  those 
procedures  which  are  recursive.  Execution  begins  at  the  last  compiled 
procedure. 

Constants,  data  specifications  and  declarations,  and  procedure 
declarations  are  local  if  they  appear  within  the  declaration  of  some 
procedure;  otherwise  they  are  qlobal.  Local  procedures,  or 
subroutines,  are  para  meterless  and  cannot  contain  any  declarations. 

Global  declared  variables  are  allocated  fixed  storage.  Local  declared 
variables  and  parameters  to  procedures  are  allocated  storage  on  the 
stack.  Specified  variables  are  not  allocated  storage  at  compile 
time,  but  the  specifications  are  used  in  the  implementation  of  access 
algorithms.  Global  declared  variables  can  be  initialized  at  compile 
time. 

There  is  a clear  distinction  between  input  and  output  parameters  to  a 
procedure.  Within  the  body  of  the  procedure,  all  parameters  may  be 
used  in  exactly  the  same  manner  as  local  variables.  The  difference  is 
that  formal  input  parameters  are  initialized  to  their  actual  input 
values  during  the  invocation  of  the  procedure,  and  the  contents  of 
formal  output  parameters  are  copied  to  their  actual  output  variables 
during  the  return  from  the  procedure. 
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Case  statements  are  implemented  such  that  the  case  values  aust  be 
saall  non-negative  integers. 

Prograa  blocks  are  procedure  bodies  or  group  statenents  (DO,  DO  CASE). 
Repeat  and  return  stateaents  cause  execution  to  go  to  the  beginning  or 
the  end  of  prograa  blocks.  The  identifier  in  a repeat  or  return 
stateaent  must  name  the  label  of  a program  block  which  statically 
encloses  the  repeat  or  return  statemi  it.  In  the  absence  of  an 
identifier,  the  inneraost  enclosing  prograa  block  is  repeated  or 
returned  froa. 


Semantics,  data  facility 

Each  procedure  parameter  consists  of  exactly  one  2-byte  word. 

Specified  or  declared  data  is  typeless.  Only  the  size  (in  bytes)  is 
designated.  Access  to  the  various  fields  of  structured  data  is 
described  by  using  gualified  naaes. 

Expression  evaluation  is  performed  from  left  to  right,  with  no  hier- 
archy except  that  imposed  by  parenthesization.  The  leftmost  datum  in 
an  expression,  or  in  a parenthesized  subexpression,  acts  as  the 
destination  operand  of  every  operation  executed  during  expression 
evaluation.  Examples: 

1,0 11 10l.i_eguiv§ient A^GQL  equivalent 

a VI  ♦ 5 ; VI  :=  VI  ♦ 5; 

a VI  ♦ a V2  ♦ 5;  a VI  ♦ a V2 ; a VI  ♦ ^ ; VI  : = VI  ♦ V2  ♦ 5; 

Word  or  byte  precision  nay  be  designated  explicitly  for  each  operation, 
or  default  precision  aay  be  used.  If  the  leftmost  operand  is  of  size  = 

1 byte,  then  byte  precision  is  the  default.  In  all  other  cases,  word 
precision  is  the  default. 
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The  following  table  shows  the  L01 1 operators,  and  the  set  of  PDP-11/20 
instructions  which  are  used  to  implement  the  operators.  The  exact 
PDP-11/20  instruction  used  depends  on  the  value  of  the  operand  to  the 
right  of  the  operator,  and  depends  on  the  precision. 

Opera toL  PDP-11/gO  instructions 


prefix 

infix 


- 

NEG 

-1 

CON 

<- 

MOV,CLR 

ADD, ADC, INC 

- 

SUB, SBC, DEC 

1 

BIS 

6 -• 

BIC 

* 

macro:  multiply  in  Extended  Arithmetic  Element 
peripheral  hardware 

/ 

macro:  divide  in  EAE, 

retain  quotient 

// 

macro:  divide  in  EAE, 

retain  remainder 

\ 

macro:  reverse  divide 

in  EAE,  retain  quotient 

w 

macro:  reverse  divide 

in  EAE,  retain  remainder 

<< 

multiple  ALS 

<« 

multiple  ROL 

>> 

multiple  ARS 

>» 

multiple  ROR 

(EAE) 


As  an  aside,  the  left  operand  was  chosen  to  be  the  destination 
than  the  right  operand,  only  so  that  the  operator  would  be 
intui  ti  ve. 


ra  ther 


The  following  table  shows  datum  syntax  and  the  usual  PDP-11/20 
addressing  node.  In  some  context-dependent  special  cases,  the 
compiler's  optimizers  will  change  the  addressing  mode  or  the 
instruction.  "a)"  means  "contents  of". 

Datum LI§li3i_dddressing_mode 


Register 
' d * ' R egist  er 
'-3'  Register 
'i'  ' ('  R eg  ister 
'a'  Register 

• aa*'  Register 
'-aa'  Register 

• da • ' ( • Register 
Value 

• a ' Value 


' aa ' Value 


• NEGATIVE', 'ZERO' 


Register 
A utoincremen  t 
Auto  decrement 
Va lue  ' ) ' I nde  xed 

Register  deferred 
A utoincremen t deferred 
Auto  deer ement  deferred 
Value  ')'  Indexed  deferred 
Immediate 

Absolute  (for  global  variables)  , or 
Indexed  by  the  stack  register  (for 
parameters  and  local  variables) 
Relative  deferred  (for  global  variables), 
or  Indexed  by  the  stack  register 
deferred  (for  parameters  and  local 
variables) 

' CAR  i.y  ' Condition  code  bits 


In  a formal  sense,  a register  is  identified  by  its  numeric  value.  But 
in  practice,  a named  constant  is  used  for  the  value  of  the  register,  so 
the  register  can  be  identified  by  the  functional  role  for  which  it  is 
used.  Example: 

DECLARE  BUFFER  [ 72  ]; 

CONSTANT  CURSOR  = 3; 

R. CURSOR  <-  BUFFER; 


10 


If  an  expression  consists  of  a single  tern  of  constant  value,  that 
constant  value  is  compiled  in  line  with  the  surrounding  code.  This 
provides  a mechanism  for  implementing  some  PDP-11/20  instructions 
(HALT,  BESET,  WAIT,  RTI,  BPT)  which  most  easily  are  described  as  naned 
constants.  It  also  provides  an  escape  mechanism  for  generating  code  not 
otherwise  describable  in  L011.  For  example,  the  code  implementing  the 
L011  conpiler  contains  (exactly)  one  GO  TO  which  violates  the  nesting 
rules  of  program  blocks. 

If  an  expression  consists  of  a single  variable  tern,  that  variable  is 
accessed  with  a PDP-11/20  TST  or  TSTB  instruction. 

All  compile-time  values  are  constant.  A guoted  constant  'Constant'  is  a 
constant  recognized  by  the  compiler's  scan  mechanism.  A 'Constant'  may 
be: 

an  unsigned  decimal  integer  in  the  interval  0 <=  integer  <=  32767 
'#'  followed  by  an  unsigned  octal  integer  in  the  interval 
#0  <=  integer  <=  #177777 

'%'  followed  by  an  unsigned  binary  integer  in  the  interval 
%0  <=  integer  <=  % 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
a quoted  string  of  0,  1,  or  2 characters,  delimited  by  ""  or  by  " 
(longer  quoted  strings  may  be  used  only  for  initializing  global 
declared  data) 

The  compiler  has  the  built-in  compile-time  functions: 

S IZE ( Qualif ie d_na me)  Value  = size  (in  bytes)  of  specified 

or  declared  data. 

E ND_L0C (Quali f ied_name)  Value  = 1st  loc.  after  end  of 

specified  or  global  declared  data. 
All  specified  data  is  assumed  to 
start  at  loc.  = 0. 

BITS (Constant_ex pression)  Value  = bit  count  of  a mask  for  the 

binary  representation  of  the 
value  of  the  expression. 

CONST_VAL(JE  (Cor.stan t_expression)  Value  = value  of  the  expression. 

Permits  a constant  expression  to  bo 
used  as  a single  numeric  constant, 
without  having  to  assign  a name  to 
the  expression  using  a constant 
sta  temen  t. 

Unmodified  relations  are  determined  between  15-bit.  signed  comparands. 
Relations  modified  by  'U'  are  determined  between  16-bit  unsigned 
com  pa  rands. 

The  comparand  LA ST_ RESULT  utilizes  the  PDP-11/20  condition  codes  which 
were  set  as  a side  effect  of  a previously  executed  instruction.  If 
L A ST_ RESULT  is  used  as  a comparand,  the  other  comparand  must  be  a 
(constant)  value  = 0. 

A test  described  by 

Da  turn  '£'  Precision  Datum  Relation  Datum 
is  a comparison  of  bits,  implemented  by  the  PDP-11/20  BIT  or  BITB 
instruction.  The  3rd  datum  described  above  must  be  a (constant) 
value  = 0. 
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Constraints  on  user  pegging 

In  order  that  the  side  effects  of  any  stated  computation  can  be  under- 
stood completely,  L01 1 clearly  grants  authority  for  controlling  the 
target  computer. 

The  PDP-11/20  hardware  interrupt  mechanism,  and  the  high-level  compiled 
code  (CALL,  REPEAT,  RETURN,  IP,  THEN,  ELSE,  CASE)  can  change  the  run- 
time contents  of  the  stack  register  (R.6)  and  the  program  counter  (R. 7). 
The  user's  program  cannot  change  the  contents  of  these  registers  by 
explicit  assignment  statements. 

The  user's  assignment  statements  can  change  the  run-time  contents  of  the 
other  registers  (R.O  to  R.  5)  and  the  declared  variables.  The  high-level 
compiled  code  does  not  change  these,  except  that  formal  output 
parameters  are  copied  into  actual  output  parameters  during  the  return 
from  a procedure. 

Partly  for  these  reasons,  L011  does  not  include: 
hierarchical  evaluation  of  expressions 
automatic  loop  control  statements  (FOR,  WHILE,  UNTIL) 
subscripted  variables 
I/O  statements 

memory  management  statements 

In  most  computer  environments,  user  programs  reguest  "system"  or 
"monitor"  services  by  executing  hardware  trap  instructions.  Any 
additional  parameters  for  the  trap  instructions  are  passed  on  the 
stack.  L01 1 forbids  the  user  to  expand  or  contract  the  stack  by 
explicit  assignment  statements,  thereby  precluding  effective  use  of  the 
trap  instructions.  In  our  single-user,  hands-on  environment,  there  is 
no  need  for  hardware  protection  against  the  actions  of  user  programs. 
Since  each  compilation  is  (in  principle)  a total  compilation,  there  is 
no  need  to  distinguish  "system"  procedures  from  "user"  procedures. 

System  services  are  reguested  by  conventional  procedure  calls. 


Access  restriction  statements 

Access  restriction  statements  grant  to  specific  procedures  the  use  of 
the  names  of  global  data,  global  named  constants,  or  global  procedures. 
This  is  the  protection  mechanism  which  L011  substitutes  for  the  nesting 
facilities  commonly  found  in  high-level  languages.  Access  to  the  name 
of  declared  data  (for  example)  may  be  granted  to  the  procedures  whose 
entry  names  appear  in  the  scope  of  the  access  restriction  statement. 

If  a path  also  appears,  access  also  may  be  granted  to  the  callers  or 
callees  of  the  named  procedure.  The  designated  distance  is  a count  of 
the  number  of  calls  of  calls  of  procedures,  leading  to  or  from  the 
named  procedure. 

Unrestricted  access  is  permitted  to  global  data,  global  named  constants, 
and  global  procedures  whose  names  do  not  appear  in  access  restriction 
statements. 

Access  restriction  statements  have  not  been  implemented  in  the  single- 
pass LO 1 1 compiler. 
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Example  program 

POUR_FUNCTION_IN  FIX_CALCULATOR  OF_OCTAL  INTEGERS: 

MODULE; 

$This  module  is  a complete,  self-contained  program  which  emulates  the 
(operation  of  a four-f unction  calculator.  The  principal  operational 
{characteristics  of  this  program  are: 

{1.  Integer  I/O  is  in  octal. 

{2.  I/O  is  performed  on  a TTY. 

{3.  Expressions  are  evaluated  from  left  to  right  with  no  hierarchy, 

{ using  infix  operators  and  ending  with 

{This  program  accepts  only  the  keys: 

{ the  digits  0 to  7 

$ ♦ - * / = 

{ C meaning  cancel  this  computation 

{Names  for  PDP-11  hardware  constants 
CONSTANT  BYTE  = 1; 

CONSTANT  WORD  = 2; 

$TOP_OF_TH E_ST ACK  must  be  the  first  declaration  of  the  program. 
{During  initialization,  the  stack  is  made  to  extend  downward  in 
{memory  from  TOP_OF_THE_STACK. 

DECLARE  TO P_OP_THE_ST  ACK  [0]; 

{Type  out  1 character  to  the  TTY,  synchronous  I/O 
TTY  OUT: 

PROCEDURE  (CHAR); 

{Unibus  addresses  of  TTY  output  registers 
CONSTANT  TTY_OUT_CSR  = #177564; 

CONSTANT  TTY~OUT_DATA  = TTY_OUT_CSR  WORD; 

{Idle  until  the  TTY  has  finished  any  previous  typing 
IF  d>  TTY_OUT  CSR  > = B 0 THEN  REPEAT; 

« TTY_0UT_D  AT  A <-  * CHAR; 

EN  D; 

T YPE_OUT_CR_LF: 

PROCEDURE; 

CONSTANT  ASCII_CAR_RET  = #15; 

CONSTANT  ASCII~ LIN E_ FEED  = #12; 

CALL  TTY_OUT (ASCII_CAR_RET)  ; 

CALL  TT  Y~OUT  ( ASCII^I  IN  E_  FEED)  ; 

END; 
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{Read  and  echo  1 character  typed  in  from  the  TTY,  synchronous  I/O 
TTY_IN: 

PROCEDURE  RETURNS  (CHAR)  ; 

SUnibus  addresses  of  TTY  input  registers 
CONSTANT  TTY_IN_CSR  = #177560; 

CONSTANT  TTY3lN_DATA  = TTY_IN_CSR  «■  WORD; 

$Mask  for  the  complement  of  a 7-bit  ASCII  character 
CONSTANT  ASCII_HASK_CO  MPLEHENT  = #177600; 

Sidle  until  a character  has  been  typed  in 

IP  d TTY_IN_CSR  >=B  0 THEN  REPEAT; 

d CHAR  <-  S~TTY_IN_DAT  A 6-.  ASCII_HASK_COHPL  EH  ENT; 

SEcho  the  character 
CALL  TTY  OUT  (d  CHAR)  ; 

END; 


T YPE_OUT_OCTAL: 

PROCEDURE  (INTEGER)  ; 

SA11  integers  are  typed  out  as  the  same  number  of  octal  digits 
CONSTANT  DIGITS_PER_OCTAL  INTEGER  = 6; 

CONSTANT  OCTAL  MASK  COMPLEMENT  = #370;  $One  byte  only 

CONSTANT  TOP_ 3~BITS~  = #160000; 

DECLARE  COU  NT_OF_DIGITS_REM  AINING  [WORD]; 

DECLARE  BUFFER_F0R_DIGITS  ( DIGITS_PER_OCT AL_INTEGER  ]; 

SDesignate  the  registers  used  in  this  procedure 
CONSTANT  ASCII_DIGIT  = 0; 

CONSTANT  BU F_CU RSOR  = 1; 

SThe  digits  are  generated  in  the  reverse  of  the  order  in  which 

Sthey  are  typed  out 

R.  BUF_Cl)RSOR  <-  80 FFER_FOR_ DIGITS ; 

{Generate  the  digits 

a COU NT _OF_DIGITS_R EH AINING  <-  D IGITS_P ER_OCT AL_INT EGER ; 

DO; 

SExtract  an  octal  digit,  but  do  not  convert  it  to  ASCII  yet 
« R.BUF_CURSOR  <-B  a)  INTEGER; 

R . BUF~CUR  SOR  &-.B  OCTA  L_MASK_CONPLEMENT; 

{Rotate  right,  discarding  any  carry  bits  that  are  rotated  in 
a INTEGER  »>  3 6-  TOP_3_BITS ; 

a COUNT_OF_DIGITS_REMAINING~-_1 ; 

IF  LAST_RESULT  > 0~THEN  REPEAT; 

END; 

{Type  out  the  digits 

a COUNT_OF_DIGITS_REMAINING  <-  DIGI TS_PER_0CT  AL_I NT  EGER ; 

DO; 

{Expand  the  byte  to  a full  word  and  convert  to  an  ASCII  octal 
{digit 

R. ASCII_DIGIT  <-B  -a  R.BUF_CURSOR  ♦ "0"; 

CALL  TTY_0U'T'  (R.  ASCI I_ DIG  IT)” ; 
a COU  NT_OF_DIGITS_R  EM  AIN ING  - 1; 

IF  LAST^RESULT  > 0 THEN  REPEAT; 

END; 

END; 


{Table  of  operator  characters 
DECLARE  OPERATORS  ( 


EQ 


( BYTE] 
[BYTE] 
[BYTE] 
[ BYTE  ] 
[ BYTE  ] 


<- 

O 

<- 

<- 

<- 


ii . « 

* 

***»*, 

"/  ”, 
"="  ]; 
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CALCULATOR: 

PROCEDURE; 

CONSTANT  TOP_3_BITS 

{Designate  the  registers  used  in  th 

CONSTANT  CHAP 

CONSTANT  OPERAND 

CONSTANT  .ACCUMULATOR 

CONSTANT  INDEX_OF_LAST_OPERATOF 

CONSTANT  INDEX~OF~NEXT~OPERATOR 

SUnibus  addresses  of  the  registers 

SEleaent  (EAR)  peripheral  hardware 

Saultiplication  and  division 


*160000 ; 


is  procedure 

0; 

1; 

2; 

3; 


in  the  Extended  Arithmetic 
device  for  perforaing  integer 


CONSTANT 

E A E 

DIVIDE 

= #177300; 

CONSTANT 

EAE 

AC 

= EAE  DIVIDE 

♦ 

WORD; 

CONSTANT 

EAE 

HO 

= EAE  DIVIDE 

♦ 

2*V0RD; 

CONSTANT 

EAE 

MULTIPLY 

= EAE  DIVIDE 

♦ 

3* WORD; 

CANCEL: 


R.  ACCUMULATOR 
R. IN  DEX_OF_NEXT_OPERATOR 
CON  ST_ V AL  UE  (5pER  ATOR  S . E\) 
CALL  TYPE_OUT  CR  LF; 
do; 

R. OPERAND 

R. INDEX_OP_LAST_OPEPATOR 

NEXT  CHAR: 


<-  0; 

<- 

- OPERATORS)  ; 

SNext  operand 
<-  0; 

<-  R . IND  EX_OF_NEXT_Q PER A TOR ; 


CALL  TTY_IN  RETURNS  (R.CHAR) ; 

IF  R.CHAR  = "C"  THEN  REPEAT  CANCEL; 
S EARC  H_FOR_OP  ERA  TOR : 

DO; 


R.INDEX_OF_NEXT_OPERATOR  <-  OPERATORS; 

DO; 

IF  J*  R . I NDEX_OF_NEXT_OPERATOR  =B  R.CHAR 
THEN  RETURN  FROh  S EA RC H_ FOR_ OPE R ATOR ; 

IF  R. INDEX_OF_NEXT_OPEP ATOR~<U  END_LJC  (OPERATORS) 
THEN  REPEAT; 

END ; 

Ilf  execution  passes  throuqh  here,  the  character  just 
Styped  in  is  not  an  operator. 
fConvert  fro®  ASCII  to  an  octal  digit 
R.CHAR  - "0"; 

IVerify  that  the  character  is  an  octal  digit 
IF  R.CHAR  Ml  7 
THEN 
DO; 

CALL  TTY_OUT ("?")  ; 

Signore  the  character 
REPEAT  NEXT_CHAR; 

END; 

STest  for  overflow 
IF  R.  OPERAND  G TOP_3_BIIS  -=  0 
THEN 
DO; 

CALL  TTY_OUT ("-")  ; 

REPEAL  CANCEL; 

END; 

{Accumulate  the  octal  digit  into  the  operand 
P. OPERAND  <<  3 | R.CHAR; 

REPEAT  NEXT  CHAR; 

END; 

1_£ 


' 


■ 


CASE 


CASE 


CASE 


CASE 


CASE 


SDeteraine  the  relative  loc-  of  the  character  in  the 
Stable  of  operator  characters 

F . INDEX_OF_NEXT_OPERATOR  - CONST_VALOE (OPEBATOBS  ♦ BYTE) 
DO  CASE” R» INDEX~ OF  LAST  OPERATOR; 

0:  ” $ ♦ 

R. ACCUMULATOR  ♦ R. OPERAND; 

Signore  possible  overflow 

Is  $ - 

R. ACCUMULATOR  - R. OPERAND; 

Signore  possible  overflow 

2:  t * 

DO; 

R. ACCUMULATOR  * R. OPERAND; 

S Test  for  overflow  in  the  EAE  peripheral  hardware 
IF  a EA E_MQ  >=  0 AND  d E AE_AC  -.=  0 OR 

« EAE_MQ  < 0 AND  a EAE_AC  -»=  #177777 
THEN 
DO; 

CALL  TTY_0UT  I 

REPEAT  CANCEL; 

END; 

END; 

3:  S / 

STest  for  divide  by  0 
IF  R. OPERAND  = 0 
THEN 
DO; 

CALL  TTY_OUT  ; 

REPEAT  CANCEL; 

END; 

ELSE  R. ACCUMULATOR  / R. OPERAND; 

4:  $ = 

R.  ACCUMULATOR  <-  R.  OPERAND; 

END; 

STest  whether  "="  just  was  typed  in 
TF  P.INDEX_OF  NEXT_OPER ATOR  = 

CONST_VALUE (OPERATORS.  EQ  - OPERATORS) 

THEN 

DO; 

CALL  TYPE_OUT_OCTAI (R. ACCUMULATOR) ; 

REPEAT  CANCEL; 


END; 

END;  $of  NFXT_CH AR 
REPEAT; 

END;  $of  next  operand 
END;  Sof  CANCEL 
END;  Sof  CALCULATOR 
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{Execution  starts  here.  This  must  be  the  last  compiled  procedure, 
Sand  it  must  not  have  any  parameters  or  local  variables. 

HAIN: 

PROCEDURE; 

CONSTANT  RESET_ INSTRUCTION  = #5 ; 

RESET_INST RUCTION; 

{Initialize  the  stack  register.  This  must  be  done  once  at  the 
{start  of  every  proqram,  before  any  procedures  can  be  called, 
{violated  the  normal  Loll  rules,  so  a kluge  is  necessary. 
#012706;  { NOV  #Next , R. ST ACK 

TOP  OP  THE_ST ACK ; 

CALL  CALCULATOR; 

{Procedure  CALCULATOR  never  returns. 

END; 

END; 
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Chara  cte^  ist  ics_of_  t he_co«£jijLe  t 


I be  single-pass  recursive  descent  compiler  aaint.ains  a string  file, 
containing  one  copy  of  each  identifier  and  each  quoted  string  that  has 
been  scanned.  In  a typical  large  compilation  (of  the  compiler  itself, 
or  of  the  operating  systea),  the  string  table  occupies  50000  bytes. 

This  necessitates  storing  the  string  table  on  a disk  file.  There  are  4 
buffers  in  sain  memory,  each  holding  one  block  of  the  string  table. 

Also  in  aain  aeaory  is  a 5-byte  reference  to  each  entry  in  the  table. 

The  syabol  table  is  entirely  in  aain  aeaory.  it  is  accessed  serially, 
in  order  to  avoid  the  extra  storage  that  would  be  required  if  any 
additional  structure  were  iaposed  on  the  symbol  table. 

As  is  typical  of  recursive  descent  compilers,  the  code  for  parsing, 
emitting,  and  optimizing  is  intermingled,  with  no  clean  separation  of 
f unction. 

The  speed  of  compilation  is  about  2400  lines  per  minute.  Most  of  the 
time  is  spent  performing  disk  I/O  for  the  strinq  table. 

It  has  become  customary  to  compare  the  size  and  speed  of  code  produced 
by  low-level  compilers,  with  code  for  the  same  program  produced  by  some 
mythical  programmer  writing  in  macro  assembler  language.  Since  such 
comparisons  can  be  only  wild  guesses  at  best,  it  is  more  meaningful  to 
indicate  what  further  improvements  could  be  made  by  hand  optimization  of 
the  code  generated  by  an  actual  compilation.  L01 1 compiled  code  is 
about  12%  larger  and  about  20%  slower  than  the  same  program  would  be 
after  careful  hand  optimization  of  the  code.  Almost  all  the  difference 
is  due  to  the  single-pass  operation  of  the  compiler,  which  forces 
compilation  of  JMP  instructions  for  most  forward  branching,  where  the 
shorter  and  faster  BR  instructions  would  suffice. 


Il£££ie!tc££ 

L011  has  been  used  to  implement  the  L011  compiler  itself,  the  Evelyn 
operating  system,  and  several  other  large  programs.  The  only  other 
language  alternative  for  implementing  systems  would  have  been  macro 
assembler  lanquage,  under  the  constraints  of  our  hardware  configuration. 

LOll  has  been  very  satisfactory  for  creating  PDP-11/20  systea  programs. 
However,  we  have  a Vector-General,  Inc.  graphics  display  connected  to 
our  PDP-11/20.  The  VG  display  has  its  own  special-purpose  CPU,  which 
shares  the  PDP-11/20  Unibus.  L01 1 never  was  intended  to  describe  the 
instruction  set  of  the  VG  display  CPU.  It  is  nuch  more  difficult  to 
write  VG  programs  in  L011  than  it  would  be  in  macro  assembler 
language.  Writing  programs  in  LOll  for  the  combination  of  the  PDP-11/20 
and  VG  display  is  feasible  only  because  very  few  VG  instructions  are 
compiled;  most  VG  instructions  are  created  dynamically  by  the  PDP-11/20 
at  run  time. 

The  LOll  compiler  was  much  more  difficult  to  create  than  we  had 
anticipated.  All  during  its  development  we  were  limited  by  the  size  of 
the  computer's  memory  (28K  words  - the  maximum  possible  for  a PDP-11/20) 
and  sometimes  we  were  limited  by  the  size  of  disk  storage  (1/2H  bytes, 
later  expanded  to  1M  bytes). 
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